A Continuum from Mixtures to Products: Aggregation under Bias

نویسندگان

  • Amos J. Storkey
  • Zhanxing Zhu
  • Jinli Hu
چکیده

This is a preprint, and does not constitute publication, but is a provided for the benefit of attendees to accompany a talk at the ICML Workshop on Divergence Methods for Probabilistic Inference. If you wish to reference this paper, please reference the final published version. Machine learning models rely heavily on two compositional methods: mixtures and products. Probabilistic aggregation also commonly uses forms of linear opinion pools (which are effectively mixtures), or log opinion pools (which are effectively products). In this paper, we introduce a complete spectrum of compositional methods, Rényi mixtures, that interpolate between mixture models and product models, and hence between log opinion pools and linear opinion pools. We show that these compositional methods are maximum entropy distributions for aggregating information from agents subject to individual biases, with the Rényi divergence parameter dependent on the bias. We also demonstrate practically that Rényi mixtures can provide better performance than log and linear opinion pools, with the optimal limit of log opinion pools when all agents are unbiased and see the same data. We infer that log opinion pools are the most appropriate aggregator for machine learning competitions. We designed, ran and analysed a machine learning Kaggle competition, the results of which confirmed this expectation. Finally we relate Rényi mixtures to recent work on machine learning markets, showing that Rényi aggregators are directly implemented by machine learning markets with isoelastic utilities, and so can result from autonomous self interested decision making by individuals contributing different predictors. Preprint made available for the ICML Workshop on Divergence Methods for Probabilistic Inference, Beijing, China, 2014. Copyright 2014 by the author(s).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of application of olive mill by-products on chickpea yield and their symbiosis with mycorrhizal fungi under arid conditions

This study investigated the effects of soil amendment with olive mill by-products (Jift) on growthof chickpea and their symbiosis with Vesicular arbascular (VA) fungi. A split plot design with threereplications was used, in which soil treatments (methyl bromide fumigated, fungicide, and untreatedcontrol) were assigned to main plots and soil-Jift mixtures(Jift: Soil; 0:10, 1:9, 2:8, 3:7, and 4:6...

متن کامل

Rational Exaggeration in Information Aggregation Games

This paper studies a class of decision-making problems under incomplete information which we call “aggregation games.” It departs from the mainstream information aggregation literature in two respects: information is aggregated by averaging rather than majority rule, and each player selects from a continuum of reports rather than making a binary choice. Each member of a group receives a private...

متن کامل

Performance Evaluation of Dynamic Modulus Predictive Models for Asphalt Mixtures

Dynamic modulus characterizes the viscoelastic behavior of asphalt materials and is the most important input parameter for design and rehabilitation of flexible pavements using Mechanistic–Empirical Pavement Design Guide (MEPDG). Laboratory determination of dynamic modulus is very expensive and time consuming. To overcome this challenge, several predictive models were developed to determine dyn...

متن کامل

Using Imperialist competitive algorithm optimization in multi-response nonlinear programming

The quality of manufactured products is characterized by many controllable quality factors. These factors should be optimized to reach high quality products. In this paper we try to find the controllable factors levels with minimum deviation from the target and with a least variation. To solve the problem a simple aggregation function is used to aggregate the multiple responses functions then a...

متن کامل

The transition energy and the beaming angle of converted LO-mode waves from 100 to 400 kHz through density gradient according to observations of kilometric continuum radiations in the plasmapause

The satellite observations such as the Cluster mission with four-point measurements show some local fluctuations in the density gradient in the vicinity of the plasmapause. These structures are found over a broad range of spatial scales, with a size from 20 to 5000 km. Also, the simultaneous observations of the kilometric continuum by IMAGE (Imager for Magnetopause-to-Aurora Global Exploration)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014